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Chromatin structure plays a key role in regulating gene expression and embryonic differentiation; however, the factors that de- 
termine the organization of chromatin around regulatory sites are not fully known. Here we show that HMGN1, a nucleosome- 
binding protein ubiquitously expressed in vertebrate cells, preferentially binds to CpG island-containing promoters and affects 
the organization of nucleosomes, DNase I hypersensitivity, and the transcriptional profile of mouse embryonic stem cells and 
neural progenitors. Loss of HMGN1 alters the organization of an unstable nucleosome at transcription start sites, reduces the 
number of DNase I-hypersensitive sites genome wide, and decreases the number of nestin-positive neural progenitors in the 
subventricular zone (SVZ) region of mouse brain. Thus, architectural chromatin-binding proteins affect the transcription pro- 
file and chromatin structure during embryonic stem cell differentiation. 



Embryonic stem cell (ESC) pluripotency and differentiation 
are controlled by the coordinated action of multiple epige- 
netic factors that affect the structure of chromatin and regulate 
gene expression. These include transcription factors, chroma- 
tin remodeling complexes, and additional proteins that alter 
the properties of chromatin such as linker histone HI variants 
and members of the high-mobility-group (HMG) protein su- 
perfamily (1-5). 

Histone HI variants and HMG protein are ubiquitously found 
in the nuclei of all vertebrate cells. These structural proteins bind 
dynamically to nucleosomes without any known DNA sequence 
specificity (6,7), modulate the local and global structures of chro- 
matin, and affect gene expression. Histone HI represents the most 
abundant family of chromatin-binding proteins; most nuclei con- 
tain sufficient HI molecules to bind to all the nucleosomes. The 
interaction of HI with nucleosomes stabilizes the compact, high- 
er-order chromatin structure (8). Vertebrate nuclei contain 3 
families of HMG proteins, named HMGA, HMGB, and HMGN, 
each of which induces specific alterations in DNA or chromatin 
structure (9, 10). The dynamic binding of HMGs to nucleosomes 
reduces the interaction of HI with chromatin and can counteract 
the chromatin-condensing activity of HI (7, 11). Thus, the inter- 
play between HI and HMG proteins could play a role in chroma- 
tin dynamics and function (12). Although extensively studied, the 
cellular function and mechanism of action of these chromatin 
architectural proteins, and the extent to which they can affect the 
properties of ESCs, are not fully elucidated. 

Here we examine the role of HMGN1, a major variant of the 
HMGN protein family, in regulating the chromatin structure and 
properties of mouse embryonic stem cells (ESCs) during their 
differentiation along the neuronal pathway. HMGNs could affect 
chromatin structure and ESC differentiation because they bind 
specifically to nucleosome core particles (CPs) ( 13, 14), affect the 
levels of histone modifications (15), and have been linked to the 
generation of DNase I-hypersensitive sites (DHS) (16), the hall- 



mark of chromatin regulatory sites. HMGNs counteract the activ- 
ity of certain nucleosome remodeling complexes (17) and thus 
could affect nucleosome positioning (18, 19). In human T cells, 
HMGN1 preferentially localizes to chromatin regulatory regions 
such as promoters, enhancers, and nucleosomes bordering CTCF 
sites (20); in tissue culture cells, either up- or downregulation of 
HMGN1 levels alters the cellular transcription profile (21). Fur- 
thermore, the expression of HMGN genes is related to differenti- 
ation processes such as erythropoiesis, myogenesis, osteoblast 
differentiation, kidney organogenesis, preimplantation develop- 
ment of early mouse embryo, and Xenopus embryogenesis (22). 
Transient depletion of HMGN proteins from one- or two-cell 
mouse embryos slowed the progression of preimplantation devel- 
opment (23), and altered HMGN expression during Xenopus em- 
bryogenesis led to malformed tadpole embryos (24). Taken to- 
gether, these results suggest a link between regulated expression of 
HMGN proteins and cellular differentiation. 

To test whether HMGN1 could affect ESC differentiation in 
the biological context of a whole organism, we first verified that 
the protein is expressed during preimplantation development 
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(23). We found robust HMGN1 expression throughout all these 
developmental stages and therefore generated ESCs from 
Hmgnl ~'~ mice, induced them to differentiate along the neuronal 
pathway, and compared their chromatin structure and transcrip- 
tion profiles to those of Hmgnl +I+ ESCs. We found strong pref- 
erential binding of HMGN1 to promoters containing CpG islands 
(CGI), especially when these were transcriptionally active. Signif- 
icantly, using limiting micrococcal nuclease digests, we detected 
an unstable nucleosome positioned at the transcription start sites 
(TSS) of CGI promoters, in a region considered to be "nucleo- 
some depleted" (25-27). The stability of these nucleosomes is re- 
lated to transcription and affected by HMGNl. Likewise, we 
found that HMGNl colocalizes with and affects the hypersensi- 
tivity of DHS at the CGI promoters of ESCs. Loss of functional 
HMGNl did not noticeably affect ESC differentiation but led to 
changes in the cellular transcription profiles of neural progenitor 
cells (NPCs) and neurons. In mouse brain, loss of HMGNl led to 
a decrease of nestin-positive NP cells in the subventricular stem 
cell zone. 

Our report provides additional information on the chromatin 
landscape of ESCs and NPCs and identifies HMGNl as a protein 
that affects chromatin structure at the transcription start sites of 
promoters containing CpG islands of ESCs and that modulates the 
number of nestin-positive cells in mouse brain. 

MATERIALS AND METHODS 

ES cell culture and in vitro differentiation. Hmgnl +I+ and Hmgnl~'~ 
ESC lines were derived from littermate 3.5-day blastocytes obtained from 
Hmgnl +, ~ and Hmgnl +I ~ mice matings (28). In Hmgnl~'~ cells, we 
deleted exons II to IV, which code for the nucleosome-binding domain of 
the protein (29). The ES cells were cocultured with mitomycin C-treated 
primary mouse embryonic fibroblast (MEF) feeders (Millipore) or cul- 
tured under feeder-free conditions; they were maintained in Knockout 
Dulbecco's modified Eagle's medium (KO-DMEM; Invitrogen), with 
20% serum replacement (SR; Invitrogen), 0.055 mM (3-mercaptoethanol 
(Sigma), 2 mM L-glutamine (Invitrogen), 0.1 mM MEM nonessential 
amino acid (Invitrogen), 5,000 U/ml penicillin-streptomycin (Invitro- 
gen), and 1,000 U/ml leukemia inhibitory factor (LIF) (Millipore). Cells 
were passaged every 2 days using 0.05% trypsin-EDTA. In vitro differen- 
tiation of ES cells according to neural lineage was performed using a 
mouse dopaminergic neuron differentiation kit (R&D Systems) accord- 
ing to the manufacturer's instruction. 

RNA and cDNA preparation. Equal amounts of cells were used for 
total RNA isolation by the use of an RNeasy Mini kit ( Qiagen) followed by 
"on-column" DNase I treatment. RNA (500 ng) was used for cDNA syn- 
thesis by iScript kit (Bio-Rad). Equal amounts of cDNA were applied for 
real-time PCRs with Power SYBR green mix (Applied Biosystems). (3-Ac- 
tin served as an internal control. Reactions and measurements were per- 
formed using an AB 7900HT Fast real-time PCR system and software. 

Antibodies, Western blotting, and immunostaining. Rabbit affinity 
pure polyclonal anti-mouse HMGNl antibodies were generated in our 
laboratory (29). Antiactin antibody was from Sigma (A5316). Anti- 
Oct3/4 antibody was from Santa Cruz (sc-5279), antinestin antibody was 
from Millipore (MAB-353), and anti-tyrosine hydroxylase (anti-Th) an- 
tibody was from Tel-Freez (P40101). Horseradish peroxidase (HRP)-con- 
jugated secondary antibodies for Western blots were from Pierce. Fluo- 
rescence secondary antibodies were Alexa Fluor 488 -donkey anti-rabbit 
IgG and Alexa Fluor 568-donkey anti-mouse IgG (Invitrogen). For 
Western blots, whole-cell lysates were prepared in 1 X SDS-PAGE sample 
buffer (Bio-Rad) supplemented with protease inhibitors (Roche Applied 
Science). The samples were fractionated on 15% precast Criterion gels, 
transferred by the semidry method to a polyvinylidene difluoride (PVDF) 
membrane, blocked with nonfat milk in Tris-buffered saline-Tween 20 



(TBST), and probed with antibodies. Chemiluminiscent detection using 
ECL Plus has been performed according to Amersham recommendations. 

For immunostaining, ES cells, neural progenitors, and neurons were 
grown on coverslips under the conditions described above. Frozen brain 
sections (30 u.m thick) were prepared from 2-week-old Hmgnl +I+ and 
Hmgnl~ : ~ mice. Immunofluorescence staining was performed using the 
corresponding antibodies showed in the figures as previously described 

(30) . The images from cultured cells were taken using a Nikon Eclipse 
E800 fluorescence microscope and processed using NIS-Elements soft- 
ware (Nikon). The images from the brain sections were taken using a Zeiss 
LSM 710 confocal system and were processed using Zeiss software. 

Dissection of mouse SVZ region. The whole brains of 2-week-old 
male Hmgnl +,+ and Hmgnl~'~ mice were collected. Sections (1 mm 
thick) were cut using a mouse brain mold and snap-frozen on glass slides 
on dry ice. The subventricular zone (SVZ) regions from all corresponding 
sections were dissected using no. 5 tweezers (Dumont) under a dissection 
microscope and pooled. 

Transcription profiling. Microarray expression analysis of total RNA 
from ES cells, neural progenitors, and neurons was performed using Af- 
fymetrix Mouse GeneChips 430 2 (430V2) as described previously (21). 
Three replicates of Hmgnl + and Hmgnl ESCs, NPCs, and neurons 
were generated. All analysis was performed using R and BioConductor 

(31) . The quality of the arrays were evaluated with R package version 
1.18.0 "affy" (32) and "simpleaffy" (33). Expression levels were derived 
using the Robust Multichip Average (RMA) protocol (34) with default 
settings. Differentially expressed genes were identified using an empirical 
Bayes method implemented in the R package "Limma" (34). P values were 
corrected for multiple testing using a false-discovery-rate method. Genes 
that have a llog 2 (fold change) I value a 1 and for which the adjusted P 
value is S0.01 are considered differentially expressed between two condi- 
tions (i.e., between Hmgn 1 + andHmgnl~'~) or between two differen- 
tiation states (i.e., NP versus ES or neuron versus NP). 

ChlP-Seq. About 2 X 10 7 cells (feeder-free ES cells or neural progen- 
itors) were cross-linked directly in culture medium with 1 % formalde- 
hyde solution for 10 min at room temperature. Chromatin was frag- 
mented to 200 to 500 bp by sonication (Bioruptor). Chromatin 
immunoprecipitation (ChIP) experiments were performed using Dyna- 
beads ( Invitrogen) and affinity-purified anti-HMGN 1 . For preparation of 
sequencing libraries, 20 ng ChIP DNA or input DNA (samples reserved 
before ChIP) were end repaired using an End-It DNA end repair kit (Epi- 
centre), and then an "A" was added to the 3' end using Klenow (3'-5' 
exo-) (NEB). After ligation with Illumina adaptors, 200-to-300-bp DNA 
fractions were size selected on a 2% E-gel (Invitrogen), extracted, and then 
amplified by 1 8 cycles of PCR using Illumina primers and Phusion DNA 
polymerase (NEB). Libraries were purified by the use of Agencourt Am- 
pure beads (Beckman Coulter) and then sequenced on Illumina Genome 
Analyzer IIx with 36-bp single-end reads. 

DNase I hypersensitivity assays. For genome-wide analysis, intact 
nuclei were isolated from trypsinized cells by incubation on ice for 10 min 
in buffer containing 0.02% IGEPAL CA-630 (Sigma), 1 mM EDTA, 0.5 
mM EGTA, 0.5 mM spermidine, and protease inhibitor (Roche Applied 
Science). DNase I digestion was carried out by incubation of 2 X 10 7 
nuclei with 40 U/ml DNase I (Roche Applied Science) at 37°C for 3 min. 
After treatment by RNase A and proteinase K, DNA was recovered by 
phenol-chloroform extraction. DNA fragments with sizes between 150 
and 500 bp were isolated by 9% sucrose gradient centrifugation. Con- 
struction of libraries and sequencing were performed the same way as 
described above for the ChlP-Seq samples. DNase I accessibility assays 
were performed to validate the genome-wide analysis. DNase I digestions 
were done using 40, 60, or 80 U/ml DNase I. The digested DNA was 
precipitated, and equal amounts of DNA were analyzed by quantitative 
real-time PCR as described previously. Primers are available upon re- 
quest. 

Genome-wide nucleosome-positioning assay. Mononucleosomal 
DNAs from wild-type and Hmgnl^ 1 ^ ES cells and neural progenitors 
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were isolated by micrococcal nuclease digestion using an EZ Nucleosomal 
DNA Prep kit (Zymo Research) according to the manufacturer's instruc- 
tion. In brief, nuclei were isolated from trypsinized cells by incubation on 
ice for 5 min in Nuclei Prep buffer. Nuclei (1.5 X 10 6 ) were then digested 
using 0.05 to 0.5 unit of micrococcal nuclease in 100 u.1 MN Digestion 
buffer at room temperature for 10 min. Nucleosomal DNA was purified 
using the columns supplied with the kit and then was then processed on 
2% agarose gel to check the size. Mononucleosomal DNA was selected for 
construction of Illumina sequencing libraries. Library preparation and 
sequencing were performed as described above for ChlP-Seq. 

Bioinformatic ChlP-Seq, DNase I-Seq, and MNase-Seq analyses. 
The Mouse Genome 430 2.0 array has 45,101 probe sets associated with 
19,824 Mouse Genome Informatics (MGI) gene identifiers. Of these, 
17,392 are experimentally validated and are listed in the NCBI RefSeq 
"NM" gene database. We used these coordinates for TSS analysis. Probe 
sets were mapped to MGI identifiers using information provided by the 
Jackson Laboratory (http://www.informatics.jax.org/). Unfiltered se- 
quencing reads (from ChlP-Seq, DNase I-Seq, or MNase-Seq) were 
aligned to the mouse reference genome (NCBI build 37, mm9) using 
Bowtie (35). Up to 1 mismatch was allowed for each aligned read. Only 
uniquely aligned reads were collected for further analysis. Binding regions 
were identified using SICER (36) with the following parameters: effective 
genome size, 0.787 (78.7% of the mouse genome is mappable); window 
size, 100 bp for HMGN ChlP-Seq and 50 bp for DNase I-Seq; gap size, 100 
bp for HMGN ChlP-Seq and 50 bp for DNase I-Seq. The SICER score 
in the graph can be viewed as an indication of peak intensity. Calcula- 
tion of coverage and identification of overlapping binding regions 
were performed with the "chipseq" and "GenomicRanges" packages in 
BioConductor (31). Specifically, in analyzing MNase-seq data, since the 
reads are from the ends of the mononucleosome particle, to best visualize 
the nucleosome positions, all the reads were shifted to the middle of the 
corresponding nucleosome core. The average sizes of mononucleosome 
fragments purified from a 2% agarose gel were determined by Bioanalyzer 
(Agilent) . For normalization, the calculation of coverage at any regions 
and the comparisons between different data sets were preceded by library 
size normalization. Control subtraction was carried out in the following 
way: coverage (exp)/Nl — coverage (control)/N2, in which "exp" is the 
data set (in .bam format) to be examined, Nl is the library size of the 
experimental data ("exp"), and N2 is the library size of the control. In this 
study, input sequences (DNA sequences after sonication only without 
immunoprecipitation) were used as a control. The function coverage that 
calculates genome coverage from .bam files is from the "chipseq" package 
in BioConductor (31). 

Microarray data accession numbers. The results of the analysis of the 
transcription changes during differentiation have been deposited at the 
Gene Expression Omnibus (GEO) database under accession no. 
GSE44175. We have deposited the BAM files in the SRA database (acces- 
sion number SRA066756). 

RESULTS 

Differentiation of Hmgnl -1 ~ embryonic stem cells along the 
neural lineage. To test whether HMGN1 could affect ESC differ- 
entiation, we first reexamined the HMGN1 expression pattern 
during mouse early embryonic differentiation (23). We found ro- 
bust HMGN1 expression in oocytes and throughout all preim- 
plantation stages, including the blastocyst stage (Fig. 1A), raising 
the possibility that HMGN1 affects the chromatin structure and 
differentiation potential of ESCs. We therefore established ESCs 
from Hmgnl +I+ and Hmgnl~'~ mice. Western blots (Fig. 1C) 
and immunofluorescence (Fig. ID) verified that the Hmgnl~'~ 
ESCs lacked intact HMGN1 protein. Loss of HMGN1 did not 
affect the viability or morphology of the ESCs. The Hmgnl~'~ 
ESC colonies, just like the Hmgnl +/+ colonies, remained undif- 
ferentiated, as indicated by staining for alkaline phosphatase 



(Fig. IE), immunostaining for Oct3/4 (Fig. ID), and reverse tran- 
scription-PCR (RT-PCR) analysis of Nanog levels (Fig. 1H). 

To test whether loss of HMGN1 affects the differentiation po- 
tential of the ESCs, we induced differentiation along the neuronal 
pathway (Fig. IB). Upon removal of LIF, both the Hmgnl ~'~ and 
Hmgnl +l+ ESCs formed embryoid bodies which could be further 
induced to differentiate into nestin-expressing NPCs as indicated 
by immunofluorescence and quantitative RT-PCR (Fig. IF and 
H). Further differentiation toward dopaminergic neurons re- 
sulted in cultures in which most cells developed axons and den- 
drites, a morphology characteristic of adult neurons (Fig. IB; see 
also Fig. SI in the supplemental material). Immunostaining for 
tyrosine hydroxylase (TH) revealed that about half of the neurons 
were dopaminergic neurons (Fig. 1G). The mRNA expression pat- 
terns of Nanog, Nestin, and tropomyosin-related kinase B ( TrkB), 
which serve as differentiation markers for ESCs, NPCs, and adult 
neurons, respectively, further confirmed that the Hmgnl ~'~ ESCs 
differentiated properly along the neuronal pathway (Fig. 1H). 

HMGN1 depletion affects gene expression during ES cell dif- 
ferentiation. Alterations in HMGN levels can affect gene expres- 
sion, leading to phenotypes that are not immediately obvious (22, 
29, 37-39). Therefore, we compared the transcription profiles of 
Hmgnl +I+ ESCs, NPCs, and neurons to those of Hmgnl~ cells 
at the same three differentiation stages. Loss of HMGN 1 altered 
significantly the expression of 19, 169, and 85 genes in ESCs, 
NPCs, and neurons, respectively (Fig. 2A and B; see also Table S1A 
in the supplemental material). Quantitative RT-PCR (Q-PCR) 
analysis of 9 upregulated and 9 downregulated genes (the 3 that 
were most changed, 3 that were moderately changed, and the 3 
that were least changed) at each differentiation stage verified the 
microarray results (see Table S2 in the supplemental material). 
Gene ontology analysis performed using DAVID (40) for all the 
genes with altered expression revealed that genes involved in de- 
velopmental processes, especially in organ morphogenesis, vascu- 
lature development, and tissue development, were the genes most 
severely affected (see Table S3 in the supplemental material). 

Progression through the differentiation stages leading from 
ESCs to neurons involves major changes in the expression of nu- 
merous genes. In both Hmgnl +I+ and Hmgnl~'~ cells, close to 
4,000 genes showed expression level changes during the switch 
either from ESCs to NPCs or from NPCs to neurons. Over 85% of 
these changes involved the same genes in both Hmgnl +/+ and 
Hmgnl ~'~ cells. In Table SIB and C in the supplemental material, 
we list the gene names and their fold change values in both 
Hmgnl +I+ or Hmgnl -1 ~ cells. In the experiment represented by 
Fig. 2C, we first sorted the genes based on their fold changes in 
expression levels in Hmgnl +l+ cells and then plotted these sorted 
genes versus the corresponding fold changes in both Hmgnl +/+ 
(blue) and Hmgnl~'~ (red) cells. The differences between red 
crosses and blue dots indicate that the magnitude of the differen- 
tiation-related changes in gene expression was affected by loss of 
HMGN1 (Fig. 2C). These changes are not due to variations be- 
tween microarray replicates, because each point represents the 
fold change calculated from 3 replicates from each cell type, 
whereas separate control analyses of all the possible versions of the 
2 replicates revealed that the variations within the Hmgnl~'~ 
or Hmgnl +/+ groups were very small (see Fig. S2 in the sup- 
plemental material). The mean variation between Hmgnl +/+ 
and Hmgnl -1- cells was more than 4-fold bigger than the varia- 
tion within duplicates (0.327 versus 0.067 [P value < 2.2e-16; 



August 2013 Volume 33 Number 16 



mcb.asm.org 3379 



Deng et al. 



B 



^ ^ 











200 pm 200 pm 


100 pm. 100 Mm. 


200 urn 200 urn 


WO M m 100 pm 




HMGN1 



B-actin 



Genotyping Western 



D 



ES cell colonies 



Oct3/4 


HMGN1 


Hoechst 


Oct3/4 


HMGN1 


Hoechst 

200 um 


Neural progenitors 


Nestin 


Hoechst 


Merged 

• • ' .Vc/i 


Nestin 


Hoechst 


Merged 

100 urn 




Alkaline phosphatase staining 



Neurons 



Th 


Hoechst 


Merged ' 






* 2^4- 










Hoechst 


Merged - , 

■ 

100 Mm 
i ■ ■ i 



H 



u„.< ™ Hmgnl*/* 
Hmgnl ^HmgnU- 




Nanog , 



I Hmgnl*/* 
1 HmgnU- 



n 



ES NP Neuron 



ES NP Neuron 



Nestin 



■ Hmgn1+/+ 
iHmgnU- 



TrkB 



I Hmgnl +/+ 
] Hmgnl-/- 



Q> 4b0W- 



ES NP Neuron 



ES NP Neuron 



FIG 1 Effects o{ Hmgnl depletion on in vitro ES cell differentiation. (A) HMGN1 protein expression during mouse embryonic preimplantation. Immunoflu- 
orescence images at the stages indicated above each panel are shown. (B) Morphology of Hmgnl +/+ and Hmgnl ~ ES cell colonies during differentiation along 
the neuronal pathway. (C) Genotyping (29) and Western analysis of the ES cells used. (D) Immunofluorescence of Oct3/4, an ES cell marker, in the ES cell 
colonies. (E) Both Hmgnl +,+ and Hmgnl ES cell colonies express the pluripotency marker alkaline phosphatase. (F) Immunofluorescence of the neural 
progenitor nestin. (G) Immunofluorescence of the dopaminergic neuron marker tyrosine hydroxylase (Th). (H) Quantitiative RT-PCR analysis of the expression 
of Hmgnl, Nanog, Nestin, and tropomyosin-related kinase B (TrkB) during Hmgnl +/+ and Hmgnl~'~ ES cell neural differentiation. The values obtained in ES 
cells were set to 100%. 



Wilcoxon rank sum test]). Thus, HMGN1 modulates and fine 
tunes the general fidelity of the cellular transcription profile and 
does not serve as a specific transcription factor that regulates the 
expression of a select subset of genes. 

Reduced nestin-positive cells in the SVZ of Hmgnl" 1 ~ mice. 
The Allen Brain Atlas database reveals prominent Hmgnl expres- 
sion in the SVZ, in the hippocampus, and in the olfactory bulbs 



(Fig. 3A), three regions enriched in neural stem/progenitor cells 
and active in neurogenesis. Indeed, immunofluorescence staining 
of brain sections from a 2-week-old mouse indicated that 
HMGN1 protein was most prominently expressed in the SVZ and 
the hippocampus, where the cells are tightly packed, as indicated 
by Hoechst staining (Fig. 3B). Double immunofluorescence anal- 
ysis of the SVZ regions revealed that nuclei of nestin-expressing 
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FIG 2 HMGN1 depletion affects gene expression during ES cell differentiation along the neuronal pathway. (A) Transcription analysis of Hmgnl +I+ and 
Hmgnl~'~ cells in the ES, NP, and neuron stages. Red dots mark genes (listed in Table S1A in the supplemental material) that were significantly affected by loss 
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visualizes the effects of HMGN1 loss on gene expression during differentiation. 



cells contained significant amounts of HMGN1, an indication that 
HMGN1 was expressed in mouse NPCs (Fig. 3C). 

To test whether loss of HMGN1 affects NPC characteristics in 
the biological context of the entire organism, we compared the 
levels of nestin-positive cells in Hmgnl -1- and Hmgnl +/+ litter- 
mates. Immunofluorescence analysis indicated that loss of 
HMGN1 reduced the number of nestin-positive cells in the SVZ 
region (Fig. 3D), and quantitative RT-PCR analysis of RNA iso- 
lated from the SVZ regions verified that the expression of Nestin 
mRNA in the Hmgnl -1 ~ SVZ was reduced by 40% compared to 
that seen in the Hmgnl +I+ SVZ (Fig. 3E). Thus, loss of HMGN1 
reduces the number of nestin-positive cells in the SVZ region of 
mouse brain. 

HMGN1 preferentially binds to promoters containing CpG 
islands. To gain insights into the mechanisms whereby HMGN1 
affects transcriptional fidelity, we examined the genome-wide dis- 
tribution of HMGN1 in ESCs and NPCs. Chromatin immunopre- 
cipitation sequencing (ChlP-Seq) failed to yield reliable data on 
the localization of HMGN1 in ESCs, most likely because their 
chromatin is hyperdynamic (41, 42) and the chromatin residence 
time of many nuclear proteins, including HMGN1 (see Fig. S3 in 
the supplemental material), is relatively short. In NPCs, however, 
we were able to determine the genome-wide distribution of 
HMGNL 

In NPCs, 72% of the HMGN1 -binding regions were located 
at gene promoter regions and within gene bodies (Fig. 4A). The 
HMGN1 peaks in the promoter regions were significantly more 



intense (P value < 2.2e-16; Wilcoxon rank sum test), an indi- 
cation of preferential association with these regions (Fig. 4B). 
Significantly, we found a striking overlap between HMGN1- 
binding promoters and CGTcontaining promoters. Among all 
24,303 annotated promoters in the NCBI Mouse mm9 Refseq 
database (http://hgdownload.soe.ucsc.edu/goldenPath/mm9 
/database/), there are 15,112 CGI promoters, 10,892 (72%) of 
which are bound by HMGNL Conversely, 10,892 (92%) of all 
11,881 promoters bound by HMGN1 also contain CGI. In con- 
trast, among the 9,181 non-CGI promoters, only 989 (11%) 
bindHMGNl (Fig. 4C). 

The Mouse genome transcription arrays contain 45,101 probes 
which identify 17,392 experimentally validated genes annotated in 
the Mouse Genome Informatics (MGI) database (43) and listed in 
the NCBI Refseq database (44) . The HMGN 1 signal strength at the 
promoters of the 6,956 most actively transcribed (40% of total 
genes [Top40] ) genes is significantly stronger than at those of the 
40% least-transcribed genes (Bot40) (f test P value < 2.2e-16),an 
indication that HMGN1 preferentially associates with transcrip- 
tionally active genes (see Fig. S4 in the supplemental material). 

In the Top40 and Bot40 groups, 6,516 (94%) and 3,416 (46%) 
genes, respectively, contain CGI in their promoters. As shown in 
Fig. 4D, in the Top40 gene group, the average HMGN1 binding 
level was 4-fold higher at CGI promoters (black) than at non-CGI 
promoters (red) (t test P value < 2.2e-16). Furthermore, even in 
the Bot40 group, the HMGN1 binding to CGI promoters (blue) 
was 3 -fold higher than to the non-CGI promoters (t test P value < 
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FIG 3 Decreased nestin-positive cells in the brain of Hmgnl~'~ mice. (A) High Hmgn 1 expression in the subventricular zone (SVZ), hippocampus, and olfactory 
bulbs of mouse brain (from www.brain-map.org with permission). The inset shows the corresponding Nissl image. (B) Immunofluorescence staining in 
2-week-old Hmgnl +,+ brain reveals high HMGN protein levels in mouse hippocampus and SVZ (outlined by dashes). (C) Double immunofluorescence reveals 
colocalization of nestin in the cytoplasm of cells expressing HMGN1 in the SVZ region. A magnified image of the area outlined by dashed lines is shown under 
each panel. (D) Decreased nestin immunofluorescence staining in the SVZ (outlined) of 2-week-old Hmgnl~'~ mice. (E) Decreased Nestin expression in the SVZ 
of Hmgnl ~'~ mice detected by RT-PCR (*, P < 0.01). 



2.2e-16). This distribution pattern was not related to transcrip- 
tion level since, in either the Top 40 or the Bot40 group, the gene 
expression levels were not related to the CGI content in their pro- 
moters (Fig. 4D, inset). Thus, the major driving force of HMGN1 
binding to promoters is the presence of CGI, rather than the gene 
expression level. In agreement, HMGN 1 -binding sites colocalized 
with H3K4Me3 (Fig. 4E and F), a histone mark associated with 
transcription initiation at CGI promoters (45). 

HMGN1 affects nucleosome organization at TSS of CpG 
island-containing promoters in ESCs. The organization of 
nucleosomes in promoters plays an important role in regulating 
gene expression levels. Given that HMGN1 binds specifically to 
nucleosomes (13, 14), we mapped the nucleosome organization in 
Hmgnl +I+ and Hmgnl~'~ ESCs and NPCs by deep sequencing of 



mononucleosomal DNA isolated from micrococcal nuclease di- 
gests. In these digests, the most accessible nucleosomes are re- 
leased first and then further digested, and the corresponding DNA 
may not be included in the mononucleosomal fraction of an ex- 
tensive digest. Therefore, we mapped the genome-wide positions 
of nucleosomes present in either limited or extensive digests 
when, respectively, either only ~5% or more than ~80% of the 
chromatin was converted to mononucleosomes. In these experi- 
ments, we ensured that the wild-type and knockout cells were 
digested to the same extent (Fig. 5A). 

The nucleosome positions seen in extensive digestion of both 
Hmgnl +I+ ESCs and NPCs were very similar to those seen in 
other cells and revealed a region depleted of nucleosome at the 
TSS and flanked by nucleosomes up- and downstream (Fig. 5B). 
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The + 1 nucleosome is the best positioned, as manifested by the 
highest peak at that site. However, analysis of the limited MNase 
digestions clearly reveals the presence of a nucleosome positioned 
at the TSS in the "nucleosome-free" regions seen in extensive di- 
gestions (arrows, Fig. 5B). The linker DNA regions flanking these 
nucleosomes are likely the most accessible to the enzyme, and 
therefore these nucleosomes are the first to be released from the 
chromatin fiber. Since these nucleosomes may contain histone 
variants that render them unstable (46), the corresponding DNA 
fragment is rapidly digested by MNase, resulting in the "nucleo- 
some-free valley" typically seen at the TSS in extensive MNase 
digests. 

In view of the extensive localization of HMGN1 at CGI pro- 
moters, we examined the nucleosome organization at both CGI 
and non-CGI promoters in the Top40 and Bot40 gene groups. We 
found that in both ESCs and NPCs, the nucleosomes in the TSS 
regions containing CGI were significantly better positioned than 
in non-CGI promoters in both extensive and limited digests (Fig. 
5C and D). The nucleosome at the TSS (position 0) is clearly seen 
in the limited digests but not in the extensive digests of CGI pro- 
moters of the Top40 group (Fig. 5D, arrow). 

We separated the CGI promoter genes of ESC and NPC into 8 
groups based on their expression values and mapped the nucleo- 
some positions around the TSS in each group (Fig. 5E and F). In 
the least-expressed group (Tl; red line) the signal from the unsta- 
ble nucleosome at TSS was highest in the extensive digestion but 
lowest in the limited digestion. Conversely, the signal from the 
most-expressed genes was highest in the limited digestions and 
lowest in the extensive digestion (Fig. 5E and F), a finding that is 



fully compatible with the notion that the DNA signal at this 
position is related to transcription level. The nucleosome at 
position + 1 became more stable or better positioned as gene 
expression levels increased. In summary, in TSS regions of the 
promoters containing CGI, the stability of the nucleosomes is 
related to transcription levels; enhanced transcription destabi- 
lizes the nucleosomes at the TSS (position 0) but stabilizes the 
nucleosome at the + 1 site. 

Loss of HMGN 1 increased the peak height of all nucleosomes 
(from —3 to +5) around the TSS of CGI promoters but only in 
limited digests. The effects were not seen in extensive digests or at 
non-CGI promoters (Fig. 5G and H). The increased peak height 
may reflect changes in nucleosome occupancy or accessibility, 
perhaps due to effects of HMGN 1 on chromatin remodeling ac- 
tivities (17) and/or on the binding of linker histone HI to chro- 
matin (47). Thus, HMGN1 affects the position or stability of the 
nucleosomes that are most susceptible to micrococcal nuclease 
digestion, i.e., the nucleosomes located in CGI-containing pro- 
moters of transcriptionally active genes. 

HMGN1 modulates the DNase I hypersensitivity in the CGI 
promoters of ESCs. Changes in nucleosome occupancy may af- 
fect the locations and strengths of the DNase I-hypersensitive sites 
(DHS) which are hallmarks of regulatory regions in chromatin. In 
NPCs, we identified 30,199 DHS and found significant overlap 
between the locations of DHS and HMGN1 (Fig. 6A and B). The 
HMGNl-binding signal at HMGN1-DHS overlapping sites was 
significantly stronger than that at nonoverlapping sites (P value < 
2.2e-16; Wilcoxon rank sum test) (bar graph, Fig. 6C, panels a and 
b), and the HMGN1-DHS overlapping sites were preferentially 
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FIG 6 Colocalization of HMGN1 and DNase I hypersensitivity sites (DHS) in neural progenitor cells. (A) Correlation between HMGN1 intensity and DNase I 
hypersensitivity at all DHS. Average coverage depths (of HMGN1 or hypersensitivity) were calculated at all DNase I-hypersensitive regions. Data, including the 
corresponding DNase I hypersensitivity data points, were sorted by HMGN coverage depth and then grouped into 1 00 data point bins and averaged. The Pearson 
correlation coefficient (R) was calculated for the binned data. (B) Correlation between input signal and DNase hypersensitivity at the same regions. (C) 
Distributions of HMGN 1 peaks that either do or do not overlap DNase HS sites (top two panels) and DNase HS sites that either do or do not overlap HMGN1 
peaks (bottom). Note that the overlapping regions have stronger signal intensities and are mostly located in gene promoters. (D) An example of colocalization 
between HMGN1, DNase I hypersensitivity, and gene promoters. 



located at promoters (pie chart, Fig. 6C, panels a and b). Thus, 
66.5% of the HMGN1 peaks that overlap DHS but only 10.7% of 
the HMGN1 sites that do not overlap DHS localize to promoters. 
Likewise, DHS that overlap HMGN1 have significantly more in- 
tense peaks (P value < 2.2e-16; Wilcoxon rank sum test) and 
localize to promoter regions with a 3-fold higher frequency 
(65.4% versus 22.1%) than DHS that do not overlap HMGN1 
(Fig. 6C panels c and d). 

In Hmgnl +/+ ESCs, we identified 56,156 DHS; 67% of these 
overlapped with the sites in CJ7 mouse ES cells mapped by the 
Mouse Encode Project (see Fig. S5 in the supplemental material). 
In Hmgnl ~'~ ESCs, however, we identified only 17,017 sites; 12% 
of the DHS are new and were not detected in Hmgnl +,+ ESC, and 
88% ( 1 5,042) of the sites overlapped the sites detected in wild-type 
ESCs (Fig. 7A). The DHS retained in Hmgnr'~ ESCs were pref- 
erentially located in CGI promoters: 58% of the 15,022 DHS that 
were located in the CGI promoters in Hmgnl +I+ ESCs were re- 



tained in Hmgnl~ ESCs. In contrast, only 15% of 41,156 DHS 
that were not in CGI promoters were retained. Importantly, the 
intensity of the DHS retained in Hmgnl ~'~ ESCs was significantly 
lowered (f test P value < 2.2e-16) (Fig. 7B andH), especially at the 
sites located in CGI promoters (Fig. 7C). Quantitative PCR analy- 
sis of the kinetics of DNase I digestion of 5 promoters verified that 
loss of HMGN1 decreases the DNase I hypersensitivity of these 
regions (Fig. 7G; see also Table S4 in the supplemental material). 
The DNase I hypersensitivity of two {Actb and Gapdh) housekeep- 
ing gene promoters, which served as controls, was not affected by 
loss of HMGN 1. In NPC, we found that loss of HMGN 1 reduced 
the number of DHS from 30,199 to 23,320 (Fig. 7D). Within the 
23,320 DHS identified in Hmgnl' 1 ' NPCs, 11% were new and 
89% overlapped the sites present in Hmgnl +/+ . Loss of HMGN1 
did not significantly alter the DHS sensitivity of the retained, over- 
lapping sites (Fig. 7E and F). 

Thus, loss of HMGN1 affects the structure of the hyperdy- 



FIG 5 Nucleosome positioning at gene transcription start sites (TSS) in ES and NP cells. (A) MNase digestions of ESCs and NPCs; the sizes of the isolated 
monomers from limited and extensive digests are shown in the bottom panels. (B) Occupancy of nucleosome isolated from limited and extensive digests at the 
TSS. (C and D) Nucleosome occupancy at TSS of Top40 and Bot40 (Bottom40) genes that either do or do not contain CpG islands after extensive digestion (C) 
and limited digestion (D). (Eand F) Correlation between nucleosome positioning at TSS and gene expression levels at ESCs (E) and NPCs (F). Genes were divided 
into 8 groups based on expression values (log 2 ) as shown at the top of panels E and F. (G and H) Effects of HMGN1 depletion on nucleosome occupancy in the 
promoters of ESCs and NPCs. Arrows point to the position of the 0 nucleosome, located just upstream of the TSS. 
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namic chromatin of ESCs to a larger extent than that of the devel- 
opmentally committed chromatin of NPC, a finding fully compat- 
ible with increased nucleosome occupancy seen in limited 
micrococcal nuclease digests in ESCs. Loss of HMGN1 reduces the 
DHS located in intergenic region and gene bodies to a larger ex- 
tent than the sites located at CGI promoters, providing a partial 
explanation for the moderate effects on transcription levels. 

DISCUSSION 

Our studies of ESC chromatin reveal that HMGN1, one of the 
few nuclear proteins that bind specifically to nucleosome cores 
but not to isolated histones or DNA, affects the organization of 
nucleosomes and the DNase I hypersensitivity at active pro- 



moters containing CGI. Loss of HMGN1 has only slight effects 
on the ESC cellular transcription profile and their potential to 
differentiate into neuronal cells; however, in the brain of 
Hmgnl~'~ mice, the levels of neural progenitors in the SVZ 
region are reduced. 

Nucleosome positioning around the TSS of active promot- 
ers plays a key role in gene expression. In limited micrococcal 
nuclease digests, in which only 5% of the genome is converted 
to nucleosome core particles (CPs), we detect a well-positioned 
nucleosome at the TSS in a region considered to be nucleosome 
depleted. The presence of this nucleosome is especially obvious 
in the CGI promoters of transcriptionally active genes in ESCs 
(Fig. 5D). The nucleosomes at the TSS of transcribed genes are 



3386 mcb.asm.org 



Molecular and Cellular Biology 



HMGN1 Affects ESC Chromatin 



likely to be relatively unstable (46) and most accessible to mi- 
crococcal nuclease; they are rapidly released and then further 
digested, leading to nucleosome-free regions which are most 
clearly delineated at TSS of genes containing CGI (25). Our 
analysis of the limit digests reveals a correlation between tran- 
scription levels and nucleosome occupancy at the TSS. The 
occupancy is highest in the most transcribed genes and lowest 
in the least transcribed genes, suggesting that the nucleosomes 
from the most transcriptionally active genes are the most rap- 
idly released during digestion. However, once released, these 
nucleosomes, which may contain H2Az variants, are unstable; 
they are rapidly digested and, therefore, in the extensive micro- 
coccal nuclease digests, the promoters of the most active genes 
show the most prominent nucleosome-free region. Thus, our 
studies reveal that the stability of nucleosomes at the TSS is 
inversely related to transcription levels, since the nucleosomes 
from the most active genes are most abundant in limit digests 
and least abundant in extensive digests, while the nucleosomes 
from the least transcribed genes are most abundant in extensive 
digests and least abundant in limit digests (compare Tl to T8 in 
Fig. 5E and F). 

Loss of HMGN1, which preferentially binds to TSS containing 
CGI (Fig. 4), stabilizes the position of the nucleosomes at these 
TSS — an unexpected finding, since HMGN proteins stabilize the 
structure of isolated nucleosomes (48) and loss of the protein 
would therefore be expected to sensitize the released nucleosomes 
to further digestion. Several mechanisms could account for the 
effects of HMGN1 on nucleosome positioning and stability. First, 
because HMGN1 competes with histone HI (47),lossofHMGNl 
would increase HI binding, thereby decreasing the ability of mi- 
crococcal nuclease to access its initial digestion sites in the linker 
DNA (8) and subsequently further degrade the released nucleo- 
some (49). Second, because HMGN1 reduces the chromatin in- 
teraction of nucleosome remodeling complexes (17), loss of 
HMGN1 could enhance the ability of these complexes to position 
nucleosomes at the TSS. Third, HMGN1 can transiently enhance 
the levels of histone acetylation and destabilize nucleosomes dur- 
ing transcription activation (50); therefore, its loss could enhance 
the stability of a nucleosome at the TSS. Fourth, because HMGN1 
is found in metastable multiprotein complexes (51) and interacts 
with chromatin modifiers such as PCNA (52) and procyclic acidic 
repetitive protein 1 (PARP1) (53),lossofHMGNl could affect the 
binding of these proteins to chromatin. Since nucleosome posi- 
tioning is determined by the coordinated action of several factors, 
including DNA-binding proteins (18), it remains to be seen 
whether one or several of these factors are most specifically af- 
fected by HMGN1. 

A possible link between the presence of HMGN and the gener- 
ation of DHS in globin genes has been previously suggested (16); 
however, the extent to which HMGN proteins shape regulatory 
sites in chromatin is not known. We now find that in NPCs, 
HMGN1 colocalizes extensively with DHS genome wide in a fash- 
ion similar to that seen in human T cells (20). In both NPCs and 
human T cells, the hypersensitivity of the DHS that colocalize with 
HMGN1 is stronger than that of the sites that do not colocalize 
with HMGN1 (Fig. 6). Furthermore, in both NPCs andT cells, the 
positional overlap of HMGN 1 with DHS is especially striking in 
the promoter region. In NPCs, 65% of the DHS that overlap 
HMGN1 but only 22% of the sites that do not overlap are located 
in the promoter region (Fig. 6C, panels c and d). Likewise, in the 



promoter region of human T cells, the frequency of DHS that 
overlap HMGN1 is 3 -fold higher than that of sites that do not 
overlap HMGN1 (28% versus 10%) (20). The preference for 
HMGN1 to colocalize with the DHS in promoters is even more 
striking. In NPCs, 66% of the HMGN1 sites that overlap DHS but 
only 1 1 % of the sites that do not overlap DHS localize to promoter 
regions. In T cells, the ratio is 32% versus 2% (20). These studies 
raise the possibility that preferential colocalization of HMGN1 
with DHS in promoter regions is a general characteristic of verte- 
brate chromatin. 

Loss of HMGN1 leads to a significant reduction in DHS, espe- 
cially in the hyperdynamic chromatin of ESCs, where the number 
of sites is reduced from 56,156 in Hmgnl +,+ to 17,017 in 
Hmgnl ~'~ ESCs. The DHS retained in Hmgnl ~'~ ESCs preferen- 
tially localize at promoters containing CGI, providing a partial 
explanation for the relative small changes in transcription seen in 
these cells. Loss of HMGN 1 reduces not only the number but also 
the hypersensitivity of the retained DHS (Fig. 7B), suggesting that 
HMGN1 plays a role in maintaining their DNase I sensitivity. 
Likewise, in NPCs, loss of HMGN1 reduces the number of DHS 
(Fig. 7D), an expected finding given their extensive colocalization 
and the observation that the presence of HMGN 1 at a site affects 
its DNase I sensitivity (Fig. 6). Similarly to ESCs, the retained sites 
localize to CGI promoters; however, in contrast to ESCs, the 
DNase I hypersensitivity at these sites is not markedly changed, 
perhaps because their chromatin is less malleable. 

Thus, loss of HMGN1 in ESCs affects two important aspects of 
chromatin organization: nucleosome positioning at the TSS and 
DNase I hypersensitivity. These effects are moderate, perhaps be- 
cause HMGN1 functions within a dynamic framework of chro- 
matin-binding proteins that compete for similar sites on nucleo- 
somes. Thus, the absence of HMGN 1 could facilitate the binding 
of another HMGN variant, especially to sites with high affinity for 
HMGN proteins, thereby minimizing changes in chromatin 
structure and activity. Indeed, preliminary experiments suggest 
that loss of HMGN1 induces reorganization in the chromatin 
binding of the closely related variant HMGN2. 

Given that HMGN1 is expressed in most vertebrate cells, binds 
specifically to nucleosomes, localizes to DHS, and affects the bind- 
ing of linker HI to chromatin, its loss would be expected to have 
significant effects on transcription and the cellular phenotype. Yet 
our present experiments and analysis of MEFs from mice lacking 
HMGN variants indicate that loss of a single HMGN has only 
minor effects on the cellular transcription profile (21). As elabo- 
rated above, functional redundancy among HMGN variants and 
other compensatory events in chromatin may minimize the dele- 
terious effects of a lack of any specific variant, including HMGN 1 . 
Yet several types of experiments, including analysis of Hmgn~'~ 
mice, reveal that lack of a single HMGN variant may lead to spe- 
cific phenotypes. Thus, Hmgn3~'~ mice are diabetic (54) and 
Hmgnl~ mice are hypersensitive to DNA damage and have an 
increased incidence of certain cancers (29, 55). The emerging data 
suggest that each HMGN variant contributes to the maintenance 
of the fidelity of the cellular transcription profile and that loss of a 
variant is deleterious, especially under stress condition such as 
exposure to heat shock (50) or DNA-damaging agents (29, 53, 55). 
Indeed, we now find that even though the Hmgnl -1- ESCs seem 
to grow and differentiate normally, they are hypersensitive to the 
DNA-damaging agent doxorubicin (Adriamycin) (see Fig. S6 in 
the supplemental material). Likewise, the decrease in nestin-pos- 
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itive cell numbers in the SVZ region of Hmgnl~ mice suggests 
that while the Hmgnl~'~ ESCs are not visibly affected in tissue 
culture, in the context of the entire organism, loss of HMGN1 
impedes the differentiation or the survival of NP cells. Loss of 
HMGN1 also alters the behavior of mice, most likely by changing 
expression of MeCp2, a protein that is highly expressed in neurons 
(37). Our studies link these phenotypic changes to HMGNl-me- 
diated alterations in chromatin structure. 
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